Search CORE

3 research outputs found

Text to image synthesis for improved image captioning

Author: Bennamoun M.
Hossain Md.Z.
Laga H.
Shiratuddin M.F.
Sohel F.
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2021
Field of study

Generating textual descriptions of images has been an important topic in computer vision and natural language processing. A number of techniques based on deep learning have been proposed on this topic. These techniques use human-annotated images for training and testing the models. These models require a large number of training data to perform at their full potential. Collecting human generated images with associative captions is expensive and time-consuming. In this paper, we propose an image captioning method that uses both real and synthetic data for training and testing the model. We use a Generative Adversarial Network (GAN) based text to image generator to generate synthetic images. We use an attention-based image captioning method trained on both real and synthetic images to generate the captions. We demonstrate the results of our models using both qualitative and quantitative analysis on popularly used evaluation metrics. We show that our experimental results achieve two fold benefits of our proposed work: i) it demonstrates the effectiveness of image captioning for synthetic images, and ii) it further improves the quality of the generated captions for real images, understandably because we use additional images for training

Research Repository

A comprehensive survey of deep learning for image captioning

Author: Hossain MD.Z.
Laga H.
Shiratuddin M.F.
Sohel F.
Publication venue: ACM Digital Library
Publication date: 01/01/2019
Field of study

Generating a description of an image is called image captioning. Image captioning requires recognizing the important objects, their attributes, and their relationships in an image. It also needs to generate syntactically and semantically correct sentences. Deep-learning-based techniques are capable of handling the complexities and challenges of image captioning. In this survey article, we aim to present a comprehensive review of existing deep-learning-based image captioning techniques. We discuss the foundation of the techniques to analyze their performances, strengths, and limitations. We also discuss the datasets and the evaluation metrics popularly used in deep-learning-based automatic image captioning

Research Repository

Alkyl and Alkoxyl Monolayers Directly Attached to Silicon: Chemical Durability in Aqueous Solutions

Author: Ara M.
Ara M.
Asanuma H.
Bergerson W. F.
Boukherroub R.
Boukherroub R.
Buriak J. M.
Buriak J. M.
de Smet L.C.P.M.
Effenberger F.
Eves B. J.
Fabre B.
Faucheux A.
Gorostiza P.
Hacker C. A.
Hossain Md.Z.
Kim N. Y.
Kurokawa S.
Leftwicha T. R.
Lehner A.
Linford M. R.
Linford M. R.
Lua Y.-Y.
Moses P. R.
Niederhauser T. L.
Osa T.
Pei Y.
Porter M. D.
Sagiv J.
Saito N.
Sano H.
Sato Y.
Sieval A. B.
Snyder R. G.
Stewart M. P.
Sugimura H.
Sun Q.-Y.
Sun Q.-Y.
Tajimi N.
Ulman A.
Wasserman S. R.
Wayner D. D. M.
Zhang L.
Publication venue: 'American Chemical Society (ACS)'
Publication date
Field of study

Crossref